Overview

Dataset statistics

Number of variables25
Number of observations103904
Missing cells310
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.8 MiB
Average record size in memory200.0 B

Variable types

Numeric19
Categorical6

Warnings

Inflight wifi service is highly correlated with Ease of Online bookingHigh correlation
Ease of Online booking is highly correlated with Inflight wifi serviceHigh correlation
Food and drink is highly correlated with Seat comfort and 2 other fieldsHigh correlation
Seat comfort is highly correlated with Food and drink and 2 other fieldsHigh correlation
Inflight entertainment is highly correlated with Food and drink and 2 other fieldsHigh correlation
On-board service is highly correlated with Baggage handling and 1 other fieldsHigh correlation
Baggage handling is highly correlated with On-board service and 1 other fieldsHigh correlation
Inflight service is highly correlated with On-board service and 1 other fieldsHigh correlation
Cleanliness is highly correlated with Food and drink and 2 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
Inflight wifi service is highly correlated with Ease of Online bookingHigh correlation
Ease of Online booking is highly correlated with Inflight wifi serviceHigh correlation
Food and drink is highly correlated with Seat comfort and 2 other fieldsHigh correlation
Seat comfort is highly correlated with Food and drink and 2 other fieldsHigh correlation
Inflight entertainment is highly correlated with Food and drink and 2 other fieldsHigh correlation
On-board service is highly correlated with Baggage handling and 1 other fieldsHigh correlation
Baggage handling is highly correlated with On-board service and 1 other fieldsHigh correlation
Inflight service is highly correlated with On-board service and 1 other fieldsHigh correlation
Cleanliness is highly correlated with Food and drink and 2 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
Inflight wifi service is highly correlated with Ease of Online bookingHigh correlation
Ease of Online booking is highly correlated with Inflight wifi serviceHigh correlation
Food and drink is highly correlated with Inflight entertainment and 1 other fieldsHigh correlation
Seat comfort is highly correlated with Inflight entertainment and 1 other fieldsHigh correlation
Inflight entertainment is highly correlated with Food and drink and 2 other fieldsHigh correlation
On-board service is highly correlated with Inflight serviceHigh correlation
Baggage handling is highly correlated with Inflight serviceHigh correlation
Inflight service is highly correlated with On-board service and 1 other fieldsHigh correlation
Cleanliness is highly correlated with Food and drink and 2 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
Seat comfort is highly correlated with Cleanliness and 5 other fieldsHigh correlation
Cleanliness is highly correlated with Seat comfort and 3 other fieldsHigh correlation
Inflight wifi service is highly correlated with Gate location and 4 other fieldsHigh correlation
Food and drink is highly correlated with Seat comfort and 2 other fieldsHigh correlation
Gate location is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
satisfaction is highly correlated with Seat comfort and 4 other fieldsHigh correlation
Arrival Delay in Minutes is highly correlated with Departure Delay in MinutesHigh correlation
On-board service is highly correlated with Leg room service and 3 other fieldsHigh correlation
Online boarding is highly correlated with Seat comfort and 5 other fieldsHigh correlation
Leg room service is highly correlated with On-board service and 2 other fieldsHigh correlation
Departure/Arrival time convenient is highly correlated with Inflight wifi service and 2 other fieldsHigh correlation
Type of Travel is highly correlated with satisfactionHigh correlation
Ease of Online booking is highly correlated with Inflight wifi service and 3 other fieldsHigh correlation
Inflight entertainment is highly correlated with Seat comfort and 6 other fieldsHigh correlation
Baggage handling is highly correlated with On-board service and 1 other fieldsHigh correlation
Departure Delay in Minutes is highly correlated with Arrival Delay in MinutesHigh correlation
Class is highly correlated with Online boardingHigh correlation
Checkin service is highly correlated with Seat comfortHigh correlation
Inflight service is highly correlated with On-board service and 3 other fieldsHigh correlation
satisfaction is highly correlated with ClassHigh correlation
Class is highly correlated with satisfaction and 1 other fieldsHigh correlation
Type of Travel is highly correlated with ClassHigh correlation
Unnamed: 0 is uniformly distributed Uniform
id is uniformly distributed Uniform
Unnamed: 0 has unique values Unique
id has unique values Unique
Inflight wifi service has 3103 (3.0%) zeros Zeros
Departure/Arrival time convenient has 5300 (5.1%) zeros Zeros
Ease of Online booking has 4487 (4.3%) zeros Zeros
Online boarding has 2428 (2.3%) zeros Zeros
Departure Delay in Minutes has 58668 (56.5%) zeros Zeros
Arrival Delay in Minutes has 58159 (56.0%) zeros Zeros

Reproduction

Analysis started2021-08-16 10:48:52.868611
Analysis finished2021-08-16 10:50:24.552396
Duration1 minute and 31.68 seconds
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

Unnamed: 0
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct103904
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51951.5
Minimum0
Maximum103903
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile5195.15
Q125975.75
median51951.5
Q377927.25
95-th percentile98707.85
Maximum103903
Range103903
Interquartile range (IQR)51951.5

Descriptive statistics

Standard deviation29994.64552
Coefficient of variation (CV)0.5773586041
Kurtosis-1.2
Mean51951.5
Median Absolute Deviation (MAD)25976
Skewness0
Sum5397968656
Variance899678760
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20471
 
< 0.1%
299881
 
< 0.1%
811651
 
< 0.1%
750221
 
< 0.1%
770711
 
< 0.1%
1037041
 
< 0.1%
996101
 
< 0.1%
1016591
 
< 0.1%
217921
 
< 0.1%
238411
 
< 0.1%
Other values (103894)103894
> 99.9%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
ValueCountFrequency (%)
1039031
< 0.1%
1039021
< 0.1%
1039011
< 0.1%
1039001
< 0.1%
1038991
< 0.1%
1038981
< 0.1%
1038971
< 0.1%
1038961
< 0.1%
1038951
< 0.1%
1038941
< 0.1%

id
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct103904
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean64924.2105
Minimum1
Maximum129880
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum1
5-th percentile6593.15
Q132533.75
median64856.5
Q397368.25
95-th percentile123409.7
Maximum129880
Range129879
Interquartile range (IQR)64834.5

Descriptive statistics

Standard deviation37463.81225
Coefficient of variation (CV)0.5770391655
Kurtosis-1.198440096
Mean64924.2105
Median Absolute Deviation (MAD)32410
Skewness0.002864248253
Sum6745885168
Variance1403537228
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40941
 
< 0.1%
830741
 
< 0.1%
522751
 
< 0.1%
625161
 
< 0.1%
645651
 
< 0.1%
584221
 
< 0.1%
604711
 
< 0.1%
338501
 
< 0.1%
358991
 
< 0.1%
461401
 
< 0.1%
Other values (103894)103894
> 99.9%
ValueCountFrequency (%)
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
101
< 0.1%
ValueCountFrequency (%)
1298801
< 0.1%
1298791
< 0.1%
1298781
< 0.1%
1298751
< 0.1%
1298741
< 0.1%
1298731
< 0.1%
1298711
< 0.1%
1298701
< 0.1%
1298691
< 0.1%
1298671
< 0.1%

Gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size811.9 KiB
Female
52727 
Male
51177 

Length

Max length6
Median length6
Mean length5.014917616
Min length4

Characters and Unicode

Total characters521070
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMale
2nd rowMale
3rd rowFemale
4th rowFemale
5th rowMale

Common Values

ValueCountFrequency (%)
Female52727
50.7%
Male51177
49.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
female52727
50.7%
male51177
49.3%

Most occurring characters

ValueCountFrequency (%)
e156631
30.1%
a103904
19.9%
l103904
19.9%
F52727
 
10.1%
m52727
 
10.1%
M51177
 
9.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter417166
80.1%
Uppercase Letter103904
 
19.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e156631
37.5%
a103904
24.9%
l103904
24.9%
m52727
 
12.6%
Uppercase Letter
ValueCountFrequency (%)
F52727
50.7%
M51177
49.3%

Most occurring scripts

ValueCountFrequency (%)
Latin521070
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e156631
30.1%
a103904
19.9%
l103904
19.9%
F52727
 
10.1%
m52727
 
10.1%
M51177
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII521070
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e156631
30.1%
a103904
19.9%
l103904
19.9%
F52727
 
10.1%
m52727
 
10.1%
M51177
 
9.8%

Customer Type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size811.9 KiB
Loyal Customer
84923 
disloyal Customer
18981 

Length

Max length17
Median length14
Mean length14.54803472
Min length14

Characters and Unicode

Total characters1511599
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLoyal Customer
2nd rowdisloyal Customer
3rd rowLoyal Customer
4th rowLoyal Customer
5th rowLoyal Customer

Common Values

ValueCountFrequency (%)
Loyal Customer84923
81.7%
disloyal Customer18981
 
18.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
customer103904
50.0%
loyal84923
40.9%
disloyal18981
 
9.1%

Most occurring characters

ValueCountFrequency (%)
o207808
13.7%
l122885
 
8.1%
s122885
 
8.1%
y103904
 
6.9%
a103904
 
6.9%
103904
 
6.9%
C103904
 
6.9%
u103904
 
6.9%
t103904
 
6.9%
m103904
 
6.9%
Other values (5)330693
21.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1218868
80.6%
Uppercase Letter188827
 
12.5%
Space Separator103904
 
6.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o207808
17.0%
l122885
10.1%
s122885
10.1%
y103904
8.5%
a103904
8.5%
u103904
8.5%
t103904
8.5%
m103904
8.5%
e103904
8.5%
r103904
8.5%
Other values (2)37962
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
C103904
55.0%
L84923
45.0%
Space Separator
ValueCountFrequency (%)
103904
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1407695
93.1%
Common103904
 
6.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
o207808
14.8%
l122885
8.7%
s122885
8.7%
y103904
7.4%
a103904
7.4%
C103904
7.4%
u103904
7.4%
t103904
7.4%
m103904
7.4%
e103904
7.4%
Other values (4)226789
16.1%
Common
ValueCountFrequency (%)
103904
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1511599
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o207808
13.7%
l122885
 
8.1%
s122885
 
8.1%
y103904
 
6.9%
a103904
 
6.9%
103904
 
6.9%
C103904
 
6.9%
u103904
 
6.9%
t103904
 
6.9%
m103904
 
6.9%
Other values (5)330693
21.9%

Age
Real number (ℝ≥0)

Distinct75
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.37970627
Minimum7
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum7
5-th percentile14
Q127
median40
Q351
95-th percentile64
Maximum85
Range78
Interquartile range (IQR)24

Descriptive statistics

Standard deviation15.1149637
Coefficient of variation (CV)0.3838262174
Kurtosis-0.7195681169
Mean39.37970627
Median Absolute Deviation (MAD)12
Skewness-0.004516127072
Sum4091709
Variance228.4621276
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
392969
 
2.9%
252798
 
2.7%
402574
 
2.5%
442482
 
2.4%
422457
 
2.4%
412456
 
2.4%
222351
 
2.3%
232346
 
2.3%
452339
 
2.3%
472329
 
2.2%
Other values (65)78803
75.8%
ValueCountFrequency (%)
7562
0.5%
8640
0.6%
9692
0.7%
10683
0.7%
11678
0.7%
12635
0.6%
13633
0.6%
14707
0.7%
15818
0.8%
16899
0.9%
ValueCountFrequency (%)
8517
 
< 0.1%
8078
 
0.1%
7942
 
< 0.1%
7833
 
< 0.1%
7787
0.1%
7645
 
< 0.1%
7561
 
0.1%
7447
 
< 0.1%
7351
 
< 0.1%
72201
0.2%

Type of Travel
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size811.9 KiB
Business travel
71655 
Personal Travel
32249 

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters1558560
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPersonal Travel
2nd rowBusiness travel
3rd rowBusiness travel
4th rowBusiness travel
5th rowBusiness travel

Common Values

ValueCountFrequency (%)
Business travel71655
69.0%
Personal Travel32249
31.0%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
travel103904
50.0%
business71655
34.5%
personal32249
 
15.5%

Most occurring characters

ValueCountFrequency (%)
s247214
15.9%
e207808
13.3%
r136153
8.7%
a136153
8.7%
l136153
8.7%
n103904
6.7%
103904
6.7%
v103904
6.7%
B71655
 
4.6%
u71655
 
4.6%
Other values (5)240057
15.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1318503
84.6%
Uppercase Letter136153
 
8.7%
Space Separator103904
 
6.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s247214
18.7%
e207808
15.8%
r136153
10.3%
a136153
10.3%
l136153
10.3%
n103904
7.9%
v103904
7.9%
u71655
 
5.4%
i71655
 
5.4%
t71655
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
B71655
52.6%
P32249
23.7%
T32249
23.7%
Space Separator
ValueCountFrequency (%)
103904
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1454656
93.3%
Common103904
 
6.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
s247214
17.0%
e207808
14.3%
r136153
9.4%
a136153
9.4%
l136153
9.4%
n103904
7.1%
v103904
7.1%
B71655
 
4.9%
u71655
 
4.9%
i71655
 
4.9%
Other values (4)168402
11.6%
Common
ValueCountFrequency (%)
103904
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1558560
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s247214
15.9%
e207808
13.3%
r136153
8.7%
a136153
8.7%
l136153
8.7%
n103904
6.7%
103904
6.7%
v103904
6.7%
B71655
 
4.6%
u71655
 
4.6%
Other values (5)240057
15.4%

Class
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size811.9 KiB
Business
49665 
Eco
46745 
Eco Plus
7494 

Length

Max length8
Median length8
Mean length5.750567832
Min length3

Characters and Unicode

Total characters597507
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEco Plus
2nd rowBusiness
3rd rowBusiness
4th rowBusiness
5th rowBusiness

Common Values

ValueCountFrequency (%)
Business49665
47.8%
Eco46745
45.0%
Eco Plus7494
 
7.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
eco54239
48.7%
business49665
44.6%
plus7494
 
6.7%

Most occurring characters

ValueCountFrequency (%)
s156489
26.2%
u57159
 
9.6%
E54239
 
9.1%
c54239
 
9.1%
o54239
 
9.1%
B49665
 
8.3%
i49665
 
8.3%
n49665
 
8.3%
e49665
 
8.3%
7494
 
1.3%
Other values (2)14988
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter478615
80.1%
Uppercase Letter111398
 
18.6%
Space Separator7494
 
1.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s156489
32.7%
u57159
 
11.9%
c54239
 
11.3%
o54239
 
11.3%
i49665
 
10.4%
n49665
 
10.4%
e49665
 
10.4%
l7494
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
E54239
48.7%
B49665
44.6%
P7494
 
6.7%
Space Separator
ValueCountFrequency (%)
7494
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin590013
98.7%
Common7494
 
1.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
s156489
26.5%
u57159
 
9.7%
E54239
 
9.2%
c54239
 
9.2%
o54239
 
9.2%
B49665
 
8.4%
i49665
 
8.4%
n49665
 
8.4%
e49665
 
8.4%
P7494
 
1.3%
Common
ValueCountFrequency (%)
7494
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII597507
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s156489
26.2%
u57159
 
9.6%
E54239
 
9.1%
c54239
 
9.1%
o54239
 
9.1%
B49665
 
8.3%
i49665
 
8.3%
n49665
 
8.3%
e49665
 
8.3%
7494
 
1.3%
Other values (2)14988
 
2.5%

Flight Distance
Real number (ℝ≥0)

Distinct3802
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1189.448375
Minimum31
Maximum4983
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum31
5-th percentile175
Q1414
median843
Q31743
95-th percentile3383
Maximum4983
Range4952
Interquartile range (IQR)1329

Descriptive statistics

Standard deviation997.1472805
Coefficient of variation (CV)0.838327498
Kurtosis0.2685354395
Mean1189.448375
Median Absolute Deviation (MAD)517
Skewness1.109465668
Sum123588444
Variance994302.6991
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
337660
 
0.6%
594395
 
0.4%
404392
 
0.4%
2475369
 
0.4%
862369
 
0.4%
447362
 
0.3%
236351
 
0.3%
192333
 
0.3%
399332
 
0.3%
308329
 
0.3%
Other values (3792)100012
96.3%
ValueCountFrequency (%)
318
 
< 0.1%
568
 
< 0.1%
67128
0.1%
7359
0.1%
7430
 
< 0.1%
761
 
< 0.1%
7741
 
< 0.1%
7830
 
< 0.1%
802
 
< 0.1%
827
 
< 0.1%
ValueCountFrequency (%)
498312
< 0.1%
496313
< 0.1%
48175
 
< 0.1%
450210
< 0.1%
424318
< 0.1%
400011
< 0.1%
39995
 
< 0.1%
39988
< 0.1%
39979
< 0.1%
39968
< 0.1%

Inflight wifi service
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.729683169
Minimum0
Maximum5
Zeros3103
Zeros (%)3.0%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.327829471
Coefficient of variation (CV)0.4864408757
Kurtosis-0.8461697189
Mean2.729683169
Median Absolute Deviation (MAD)1
Skewness0.04040802158
Sum283625
Variance1.763131105
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
325868
24.9%
225830
24.9%
419794
19.1%
117840
17.2%
511469
11.0%
03103
 
3.0%
ValueCountFrequency (%)
03103
 
3.0%
117840
17.2%
225830
24.9%
325868
24.9%
419794
19.1%
511469
11.0%
ValueCountFrequency (%)
511469
11.0%
419794
19.1%
325868
24.9%
225830
24.9%
117840
17.2%
03103
 
3.0%

Departure/Arrival time convenient
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.060296043
Minimum0
Maximum5
Zeros5300
Zeros (%)5.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.525075197
Coefficient of variation (CV)0.4983423748
Kurtosis-1.037767284
Mean3.060296043
Median Absolute Deviation (MAD)1
Skewness-0.3343986322
Sum317977
Variance2.325854357
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
425546
24.6%
522403
21.6%
317966
17.3%
217191
16.5%
115498
14.9%
05300
 
5.1%
ValueCountFrequency (%)
05300
 
5.1%
115498
14.9%
217191
16.5%
317966
17.3%
425546
24.6%
522403
21.6%
ValueCountFrequency (%)
522403
21.6%
425546
24.6%
317966
17.3%
217191
16.5%
115498
14.9%
05300
 
5.1%

Ease of Online booking
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.756900601
Minimum0
Maximum5
Zeros4487
Zeros (%)4.3%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.398929473
Coefficient of variation (CV)0.5074283318
Kurtosis-0.9103462085
Mean2.756900601
Median Absolute Deviation (MAD)1
Skewness-0.01829427334
Sum286453
Variance1.957003669
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
324449
23.5%
224021
23.1%
419571
18.8%
117525
16.9%
513851
13.3%
04487
 
4.3%
ValueCountFrequency (%)
04487
 
4.3%
117525
16.9%
224021
23.1%
324449
23.5%
419571
18.8%
513851
13.3%
ValueCountFrequency (%)
513851
13.3%
419571
18.8%
324449
23.5%
224021
23.1%
117525
16.9%
04487
 
4.3%

Gate location
Real number (ℝ≥0)

HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.976882507
Minimum0
Maximum5
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.27762101
Coefficient of variation (CV)0.4291808653
Kurtosis-1.030283299
Mean2.976882507
Median Absolute Deviation (MAD)1
Skewness-0.05888941158
Sum309310
Variance1.632315446
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
328577
27.5%
424426
23.5%
219459
18.7%
117562
16.9%
513879
13.4%
01
 
< 0.1%
ValueCountFrequency (%)
01
 
< 0.1%
117562
16.9%
219459
18.7%
328577
27.5%
424426
23.5%
513879
13.4%
ValueCountFrequency (%)
513879
13.4%
424426
23.5%
328577
27.5%
219459
18.7%
117562
16.9%
01
 
< 0.1%

Food and drink
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.202128888
Minimum0
Maximum5
Zeros107
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.329532711
Coefficient of variation (CV)0.4152027471
Kurtosis-1.145453205
Mean3.202128888
Median Absolute Deviation (MAD)1
Skewness-0.151279497
Sum332714
Variance1.767657229
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
424359
23.4%
522313
21.5%
322300
21.5%
221988
21.2%
112837
12.4%
0107
 
0.1%
ValueCountFrequency (%)
0107
 
0.1%
112837
12.4%
221988
21.2%
322300
21.5%
424359
23.4%
522313
21.5%
ValueCountFrequency (%)
522313
21.5%
424359
23.4%
322300
21.5%
221988
21.2%
112837
12.4%
0107
 
0.1%

Online boarding
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.250375346
Minimum0
Maximum5
Zeros2428
Zeros (%)2.3%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.349508954
Coefficient of variation (CV)0.4151855739
Kurtosis-0.7020058043
Mean3.250375346
Median Absolute Deviation (MAD)1
Skewness-0.4538516953
Sum337727
Variance1.821174416
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
430762
29.6%
321804
21.0%
520713
19.9%
217505
16.8%
110692
 
10.3%
02428
 
2.3%
ValueCountFrequency (%)
02428
 
2.3%
110692
 
10.3%
217505
16.8%
321804
21.0%
430762
29.6%
520713
19.9%
ValueCountFrequency (%)
520713
19.9%
430762
29.6%
321804
21.0%
217505
16.8%
110692
 
10.3%
02428
 
2.3%

Seat comfort
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.439395981
Minimum0
Maximum5
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.319087519
Coefficient of variation (CV)0.3835230157
Kurtosis-0.9257020682
Mean3.439395981
Median Absolute Deviation (MAD)1
Skewness-0.4827753882
Sum357367
Variance1.739991882
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
431765
30.6%
526470
25.5%
318696
18.0%
214897
14.3%
112075
 
11.6%
01
 
< 0.1%
ValueCountFrequency (%)
01
 
< 0.1%
112075
 
11.6%
214897
14.3%
318696
18.0%
431765
30.6%
526470
25.5%
ValueCountFrequency (%)
526470
25.5%
431765
30.6%
318696
18.0%
214897
14.3%
112075
 
11.6%
01
 
< 0.1%

Inflight entertainment
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.35815753
Minimum0
Maximum5
Zeros14
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.332990715
Coefficient of variation (CV)0.3969410913
Kurtosis-1.060695752
Mean3.35815753
Median Absolute Deviation (MAD)1
Skewness-0.3651305877
Sum348926
Variance1.776864245
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
429423
28.3%
525213
24.3%
319139
18.4%
217637
17.0%
112478
12.0%
014
 
< 0.1%
ValueCountFrequency (%)
014
 
< 0.1%
112478
12.0%
217637
17.0%
319139
18.4%
429423
28.3%
525213
24.3%
ValueCountFrequency (%)
525213
24.3%
429423
28.3%
319139
18.4%
217637
17.0%
112478
12.0%
014
 
< 0.1%

On-board service
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.382362565
Minimum0
Maximum5
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.288354361
Coefficient of variation (CV)0.3809036837
Kurtosis-0.8923352438
Mean3.382362565
Median Absolute Deviation (MAD)1
Skewness-0.4200307451
Sum351441
Variance1.659856959
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
430867
29.7%
523648
22.8%
322833
22.0%
214681
14.1%
111872
 
11.4%
03
 
< 0.1%
ValueCountFrequency (%)
03
 
< 0.1%
111872
 
11.4%
214681
14.1%
322833
22.0%
430867
29.7%
523648
22.8%
ValueCountFrequency (%)
523648
22.8%
430867
29.7%
322833
22.0%
214681
14.1%
111872
 
11.4%
03
 
< 0.1%

Leg room service
Real number (ℝ≥0)

HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.35105482
Minimum0
Maximum5
Zeros472
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median4
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.315604619
Coefficient of variation (CV)0.3925941801
Kurtosis-0.9802569111
Mean3.35105482
Median Absolute Deviation (MAD)1
Skewness-0.3502313446
Sum348188
Variance1.730815514
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
428789
27.7%
524667
23.7%
320098
19.3%
219525
18.8%
110353
 
10.0%
0472
 
0.5%
ValueCountFrequency (%)
0472
 
0.5%
110353
 
10.0%
219525
18.8%
320098
19.3%
428789
27.7%
524667
23.7%
ValueCountFrequency (%)
524667
23.7%
428789
27.7%
320098
19.3%
219525
18.8%
110353
 
10.0%
0472
 
0.5%

Baggage handling
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size811.9 KiB
4
37383 
5
27131 
3
20632 
2
11521 
1
7237 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters103904
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row3
3rd row4
4th row3
5th row4

Common Values

ValueCountFrequency (%)
437383
36.0%
527131
26.1%
320632
19.9%
211521
 
11.1%
17237
 
7.0%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
437383
36.0%
527131
26.1%
320632
19.9%
211521
 
11.1%
17237
 
7.0%

Most occurring characters

ValueCountFrequency (%)
437383
36.0%
527131
26.1%
320632
19.9%
211521
 
11.1%
17237
 
7.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number103904
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
437383
36.0%
527131
26.1%
320632
19.9%
211521
 
11.1%
17237
 
7.0%

Most occurring scripts

ValueCountFrequency (%)
Common103904
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
437383
36.0%
527131
26.1%
320632
19.9%
211521
 
11.1%
17237
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII103904
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
437383
36.0%
527131
26.1%
320632
19.9%
211521
 
11.1%
17237
 
7.0%

Checkin service
Real number (ℝ≥0)

HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.304290499
Minimum0
Maximum5
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.265395827
Coefficient of variation (CV)0.382955381
Kurtosis-0.8288770565
Mean3.304290499
Median Absolute Deviation (MAD)1
Skewness-0.3649819608
Sum343329
Variance1.601226599
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
429055
28.0%
328446
27.4%
520619
19.8%
212893
12.4%
112890
12.4%
01
 
< 0.1%
ValueCountFrequency (%)
01
 
< 0.1%
112890
12.4%
212893
12.4%
328446
27.4%
429055
28.0%
520619
19.8%
ValueCountFrequency (%)
520619
19.8%
429055
28.0%
328446
27.4%
212893
12.4%
112890
12.4%
01
 
< 0.1%

Inflight service
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.640427702
Minimum0
Maximum5
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.175663034
Coefficient of variation (CV)0.3229464035
Kurtosis-0.3575091976
Mean3.640427702
Median Absolute Deviation (MAD)1
Skewness-0.6903139573
Sum378255
Variance1.382183569
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
437945
36.5%
527116
26.1%
320299
19.5%
211457
 
11.0%
17084
 
6.8%
03
 
< 0.1%
ValueCountFrequency (%)
03
 
< 0.1%
17084
 
6.8%
211457
 
11.0%
320299
19.5%
437945
36.5%
527116
26.1%
ValueCountFrequency (%)
527116
26.1%
437945
36.5%
320299
19.5%
211457
 
11.0%
17084
 
6.8%
03
 
< 0.1%

Cleanliness
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.286350862
Minimum0
Maximum5
Zeros12
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.312272847
Coefficient of variation (CV)0.3993100256
Kurtosis-1.012557651
Mean3.286350862
Median Absolute Deviation (MAD)1
Skewness-0.3000744927
Sum341465
Variance1.722060025
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
427179
26.2%
324574
23.7%
522689
21.8%
216132
15.5%
113318
12.8%
012
 
< 0.1%
ValueCountFrequency (%)
012
 
< 0.1%
113318
12.8%
216132
15.5%
324574
23.7%
427179
26.2%
522689
21.8%
ValueCountFrequency (%)
522689
21.8%
427179
26.2%
324574
23.7%
216132
15.5%
113318
12.8%
012
 
< 0.1%

Departure Delay in Minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct446
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.81561826
Minimum0
Maximum1592
Zeros58668
Zeros (%)56.5%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q312
95-th percentile78
Maximum1592
Range1592
Interquartile range (IQR)12

Descriptive statistics

Standard deviation38.23090058
Coefficient of variation (CV)2.580445845
Kurtosis100.2670058
Mean14.81561826
Median Absolute Deviation (MAD)0
Skewness6.73397951
Sum1539402
Variance1461.601759
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
058668
56.5%
12948
 
2.8%
22274
 
2.2%
32009
 
1.9%
41854
 
1.8%
51692
 
1.6%
61517
 
1.5%
71392
 
1.3%
81295
 
1.2%
91255
 
1.2%
Other values (436)29000
27.9%
ValueCountFrequency (%)
058668
56.5%
12948
 
2.8%
22274
 
2.2%
32009
 
1.9%
41854
 
1.8%
51692
 
1.6%
61517
 
1.5%
71392
 
1.3%
81295
 
1.2%
91255
 
1.2%
ValueCountFrequency (%)
15921
< 0.1%
13051
< 0.1%
10171
< 0.1%
9781
< 0.1%
9331
< 0.1%
9301
< 0.1%
9211
< 0.1%
8591
< 0.1%
8531
< 0.1%
7501
< 0.1%

Arrival Delay in Minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct455
Distinct (%)0.4%
Missing310
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean15.1786783
Minimum0
Maximum1584
Zeros58159
Zeros (%)56.0%
Negative0
Negative (%)0.0%
Memory size811.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q313
95-th percentile79
Maximum1584
Range1584
Interquartile range (IQR)13

Descriptive statistics

Standard deviation38.69868202
Coefficient of variation (CV)2.549542276
Kurtosis94.5370055
Mean15.1786783
Median Absolute Deviation (MAD)0
Skewness6.596636807
Sum1572420
Variance1497.58799
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
058159
56.0%
12211
 
2.1%
22064
 
2.0%
31952
 
1.9%
41907
 
1.8%
51658
 
1.6%
61616
 
1.6%
71481
 
1.4%
81394
 
1.3%
91264
 
1.2%
Other values (445)29888
28.8%
ValueCountFrequency (%)
058159
56.0%
12211
 
2.1%
22064
 
2.0%
31952
 
1.9%
41907
 
1.8%
51658
 
1.6%
61616
 
1.6%
71481
 
1.4%
81394
 
1.3%
91264
 
1.2%
ValueCountFrequency (%)
15841
< 0.1%
12801
< 0.1%
10111
< 0.1%
9701
< 0.1%
9521
< 0.1%
9241
< 0.1%
9201
< 0.1%
8601
< 0.1%
8231
< 0.1%
7291
< 0.1%

satisfaction
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size811.9 KiB
neutral or dissatisfied
58879 
satisfied
45025 

Length

Max length23
Median length23
Mean length16.93334232
Min length9

Characters and Unicode

Total characters1759442
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowneutral or dissatisfied
2nd rowneutral or dissatisfied
3rd rowsatisfied
4th rowneutral or dissatisfied
5th rowsatisfied

Common Values

ValueCountFrequency (%)
neutral or dissatisfied58879
56.7%
satisfied45025
43.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
dissatisfied58879
26.6%
or58879
26.6%
neutral58879
26.6%
satisfied45025
20.3%

Most occurring characters

ValueCountFrequency (%)
i266687
15.2%
s266687
15.2%
e162783
9.3%
t162783
9.3%
a162783
9.3%
d162783
9.3%
r117758
6.7%
117758
6.7%
f103904
 
5.9%
n58879
 
3.3%
Other values (3)176637
10.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1641684
93.3%
Space Separator117758
 
6.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i266687
16.2%
s266687
16.2%
e162783
9.9%
t162783
9.9%
a162783
9.9%
d162783
9.9%
r117758
7.2%
f103904
 
6.3%
n58879
 
3.6%
u58879
 
3.6%
Other values (2)117758
7.2%
Space Separator
ValueCountFrequency (%)
117758
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1641684
93.3%
Common117758
 
6.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
i266687
16.2%
s266687
16.2%
e162783
9.9%
t162783
9.9%
a162783
9.9%
d162783
9.9%
r117758
7.2%
f103904
 
6.3%
n58879
 
3.6%
u58879
 
3.6%
Other values (2)117758
7.2%
Common
ValueCountFrequency (%)
117758
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1759442
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i266687
15.2%
s266687
15.2%
e162783
9.3%
t162783
9.3%
a162783
9.3%
d162783
9.3%
r117758
6.7%
117758
6.7%
f103904
 
5.9%
n58879
 
3.3%
Other values (3)176637
10.0%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

Unnamed: 0idGenderCustomer TypeAgeType of TravelClassFlight DistanceInflight wifi serviceDeparture/Arrival time convenientEase of Online bookingGate locationFood and drinkOnline boardingSeat comfortInflight entertainmentOn-board serviceLeg room serviceBaggage handlingCheckin serviceInflight serviceCleanlinessDeparture Delay in MinutesArrival Delay in Minutessatisfaction
0070172MaleLoyal Customer13Personal TravelEco Plus460343153554344552518.0neutral or dissatisfied
115047Maledisloyal Customer25Business travelBusiness2353233131115314116.0neutral or dissatisfied
22110028FemaleLoyal Customer26Business travelBusiness11422222555543444500.0satisfied
3324026FemaleLoyal Customer25Business travelBusiness56225552222253142119.0neutral or dissatisfied
44119299MaleLoyal Customer61Business travelBusiness2143333455334433300.0satisfied
55111157FemaleLoyal Customer26Personal TravelEco11803421121134444100.0neutral or dissatisfied
6682113MaleLoyal Customer47Personal TravelEco127624232222334352923.0neutral or dissatisfied
7796462FemaleLoyal Customer52Business travelBusiness20354344555555545440.0satisfied
8879485FemaleLoyal Customer41Business travelBusiness8531222433112141200.0neutral or dissatisfied
9965725Maledisloyal Customer20Business travelEco10613334233223443200.0neutral or dissatisfied

Last rows

Unnamed: 0idGenderCustomer TypeAgeType of TravelClassFlight DistanceInflight wifi serviceDeparture/Arrival time convenientEase of Online bookingGate locationFood and drinkOnline boardingSeat comfortInflight entertainmentOn-board serviceLeg room serviceBaggage handlingCheckin serviceInflight serviceCleanlinessDeparture Delay in MinutesArrival Delay in Minutessatisfaction
10389410389486549MaleLoyal Customer26Business travelBusiness712444455553443451726.0satisfied
10389510389566030Femaledisloyal Customer24Business travelEco1055111211113355411310.0neutral or dissatisfied
10389610389671445MaleLoyal Customer57Business travelEco8674555444434313400.0neutral or dissatisfied
103897103897102203FemaleLoyal Customer60Business travelBusiness15995555554444444497.0satisfied
10389810389860666MaleLoyal Customer50Personal TravelEco16203134232243424200.0neutral or dissatisfied
10389910389994171Femaledisloyal Customer23Business travelEco1922123222231423230.0neutral or dissatisfied
10390010390073097MaleLoyal Customer49Business travelBusiness23474444245555555400.0satisfied
10390110390168825Maledisloyal Customer30Business travelBusiness199511134154324554714.0neutral or dissatisfied
10390210390254173Femaledisloyal Customer22Business travelEco10001115111145154100.0neutral or dissatisfied
10390310390362567MaleLoyal Customer27Business travelBusiness17231333111111443100.0neutral or dissatisfied